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ABSTRACT: Comprehensive information on genetic diversity and introgression is desirable for the design of rational breed 
improvement and conservation programs. Despite the concerns regarding the genetic introgression of Western pig breeds into the gene 
pool of the Korean native pig (KNP), the level of this admixture has not yet been quantified. In the present study, we genotyped 93 
animals, representing four Western pig breeds and KNP, using the porcine SNP 60K BeadChip to assess their genetic diversity and to 
estimate the level of admixture among the breeds. Expected heterozygosity was the lowest in Berkshire (0.31) and highest in Landrace 
(0.42). Population differentiation (Fgx) estimates were significantly different (p<0.000), accounting for 27% of the variability among the 
breeds. The evidence of inbreeding observed in KNP (0.029) and Yorkshire (0.031) may result in deficient heterozygosity. Principal 
components one (PCI) and two (PC2) explained approximately 35.06% and 25.20% of the variation, respectively, and placed KNP 
somewhat proximal to the Western pig breeds (Berkshire and Landrace). When K = 2, KNP shared a substantial proportion of ancestry 
with Western breeds. Similarly, when A" = 3, over 86% of the KNP individuals were in the same cluster with Berkshire and Landrace. 
The linkage disquilbrium (LD) values at / q ^, the physical distance at which LD decays below a threshold of 0.3, ranged from 72.40 kb 
in Landrace to 85.86 kb in Yorkshire. Based on our structure analysis, a substantial level of admixture between Western and Korean 
native pig breeds was observed. (Key Words: Admixture, Genetic Diversity, Korean Native Pig, Single Nucleotide Polymorphism) 



INTRODUCTION 

Evidence supports the independent domestication of 
pigs in Asia and Europe from wild boar subspecies (Giuffra 
et al., 2000; Kijas and Andersson, 2001; Larson et al., 2005). 
The majority of the world's pig genetic resources comprise 
indigenous types. Native or local pig breeds harbor unique 
genetic variants that may provide future breeds with the 
flexibility to change in response to product market 
preferences, production environments, and breeding goals. 
Despite the key role played by indigenous breeds in 
marginal environments, pig improvement and development 
programs have focused on several intensively selected 
Western (European) pig breeds, leaving locally adapted 
indigenous breeds unsuitable for intensive commercial 
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breeding due mainly to their slow growth rates, light 
carcass weights, low feed conversion efficiencies, and low 
lean meat production levels (Yang et al., 2003). As a 
consequence of indiscriminate crossbreeding and breed 
substitution, over 18% of the world's pig breeds are 
experiencing genetic erosion (Nidup and Moran, 2001; 
Food and Agriculture Organization of the United Nations 
(FAO, 2007), while the status of the vast majority of breeds 
remains unknown. For example, many commercial breeds 
introduced into China have been shown to contribute to the 
genetic pools of most indigenous breeds (Yang et al., 2003). 

Similarly, for the last two decades. Western pig breeds 
have been imported into the Korean peninsula and crossed 
with the Korean native pig (KNP) to improve its growth 
and carcass-related traits, which has led to a dramatic drop 
in the KNP population (Yeo et al., 2000; Kim et al., 2005). 
Small effective population sizes can result in the loss of 
genetic diversity, which can in turn lead to reduced 
individual fitness and response to selection (Ficetola et al., 
2007). Although commercial pig breeds are superior in 
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terms of growth and feed efficiency traits, the KNP is 
known for its better meat quality (high redness and 
intramuscular fat), and greater ability to thrive under low- 
management conditions (Yeo et al., 2000; Park et al., 2007; 
Kim et al., 2011). Despite the concerns regarding the loss of 
genetic diversity from and introgression of Western pig 
breeds into the KNP, the level of admixture at the whole- 
genome level remains unclear. 

Comprehensive information on genetic diversity and 
introgression is desirable for the design of rational breed 
improvement and conservation programs. Several previous 
studies have examined the genetic diversity of KNP. 
However, these studies have focused exclusively on limited 
numbers of microsatelhte markers (Kim and Choi, 2002; 
Kim et al., 2005) and mtDNA (Kim et al., 2002a). Over the 
past several years, through the advancement of high- 
throughput sequencing technology, thousands of single 
nucleotide polymorphisms (SNPs) have been discovered, 
creating unprecedented opportunities for genome-wide 
genetic diversity analysis, genetic mapping, genome-wide 
association studies, and marker-assisted selection (Duran et 
al., 2009). SNPs embody the main source of genetic 
variation, have low mutation rates, and are associated with 
complex heritable traits (Suh and Vijg, 2005). 

As in cattle and sheep, thousands of SNPs have been 
identified in pigs using next-generation sequencing 
technology (Ramos et al., 2009). The lUumina Porcine 
SNP60K BeadChip allows for more precise and 
comprehensive genome-wide investigations of genetic 
diversity and admixture levels among pig breeds than could 
be performed using smaller and lower-density microsatellite 
markers. The chip can also be used to estimate the level of 
linkage disequilibrium (LD) at the genome level. This 
ability will further help to refine previous investigations 
conducted using molecular markers of limited number and 
density. To the best of our knowledge, this study is the first 
to employ SNPs to investigate the genetic diversity of and 
quantify the level of European introgression into KNP. In 
this study, we genotyped 93 animals, representing four 
Western pig breeds and a KNP population, using the high- 
density Porcine 60K SNP BeadChip to explore the genetic 
diversity within and between breeds and to estimate the 
level of admixture between Westem and Korean native pig 
breeds. 

MATERIALS AND METHODS 

Breeds, sampling, and genotyping 

Samples were collected from a total of 93 animals, 
representing four Western pig breeds, Berkshire (BK, n = 
30), Duroc (DU, n = 24), Landrace (LR, n = 6), and 
Yorkshire (YK, n = 17), and KNP (n = 16). All DNA 



samples were obtained from the National Institute of 
Animal Science (NIAS) in Korea and commercially 
genotyped using the Porcine SNP 60 BeadChip according to 
the Illumina protocols (http://www.illumina.com). Common 
monomorphic SNPs for all of the breeds were discarded 
from further analyses. Autosomal SNPs were filtered with 
the criteria of call rate (>90%), minor allele frequency 
(>5%), and Hardy Weinberg equilibrium (>0.001). A final 
total of approximately 31,755 SNPs was considered for the 
study. 

Analyses of molecular variance 

Analyses of molecular variance (AMOVA) were 
conducted using the program Arlequin (Excoffier et al., 
2005) to determine the genetic variance within and among 
breeds. To assess genetic diversity, observed heterozygosity 
(Ho) and expected heterozygosity (He) were calculated 
within each breed. Fixation indices were estimated 
according to Weir and Cockerham (1984) using the same 
package. The significance of the fixation indices was 
determined using permutation tests (1,000 permutations). 
The significance of the pairwise population differentiation 
values was similarly determined by permutation testing 
(1,000 permutations). Reynolds genetic distance (Reynolds 
et al., 1983) was calculated using PowerMarker (Liu and 
Muse, 2005). 

Population admixture 

For the analysis of population structure, a Bayesian 
model-based analysis was performed using structure 2.3.4 
(Pritchard et al., 2000). This software assumes a model in 
which there are K populations (clusters) that contribute to 
the genotype of each individual and are characterized by a 
set of allele frequencies at each marker locus. The method 
attempts to assign individuals to populations based on their 
genotypes while simultaneously estimating progenitor 
population allele frequencies. A Monte Carlo Markov chain 
method was used to estimate the allele frequencies in each 
of the K populations and the degree of admixture for each 
individual animal. The number of clusters was inferred 
using five independent runs with 100,000 iterations and a 
burn-in period of 20,000 iterations, following the admixture 
ancestry model and the correlated allele frequencies for K 
values ranging from 2 to 5. The pattern of breed admixture 
was further analyzed through principal component analysis 
(PCA) using the SNP and variation suite version 7 (Golden 
Helix, Inc., Bozeman, MT, USA www.goldenhelix.com). 
The PCA was conducted to determine the breed 
relationships based directly on allele frequencies using a 
multivariate method, which condenses the information from 
a large number of alleles and loci into a few synthetic 
variables. 
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leneUc diversity and their standard deviations in Korean native pig and Western pi 






Breed 


MAP 


Ho 


He 


Gene diversity 


Fis 


HWE 


Berkshire 


0.18 (0.16) 


0.32(0.17) 


0.31 (0.15) 


0.25 (0.12) 


-0.007 


2.04 (648) 


Duroc 


0.19(0.16 


0.36 (0.16) 


0.35 (0.14) 


0.25 (0.12) 


-0.026 


1.67 (531) 


Korean native pig 


0.18(0.16) 


0.32 (0.17) 


0.33 (0.15) 


0.25 (0.12) 


0.029 


2.37 (754) 


Landrace 


0.19 (0.16) 


0.42 (0.22) 


0.39 (0.13) 


0.28(0.14) 


-0.086 


0.30 (95) 


Yorksiiire 


0.21 (0.16) 


0.35 (0.17) 


0.36 (0.14) 


0.28(0.13) 


0.031 


3.61 (1,147) 



MAF, minor allele frequency; Ho, observed heterozygosity; He, expected heterozygosity; Fjs, within-population deficit in heterozygosity; HWE, 
proportion of SNPs showing deviation from Hardy- Weinberg equiUbrium (p<0.05). 



Linkage disequilibrium analysis 

Among the several measures of LD, and \D'\ are 
usually used to estimate the extent of LD (Hill, 1981). The 
LD estimation using \D'\ is particularly useful for 
determining LD at longer distances and is known to be 
robust for smaller sample sizes (Chen et al., 2006; Khatkar 
et al., 2008). We estimated linkage disequilibrium using 
both \D'\ and between adjacent SNPs with the SNP and 
variation suite version 7 (Golden Hehx, Inc., USA). 

RESULTS 

Within-breed genetic diversity 

Genetic variability parameters are presented in Table 1 
for the four Western pig breeds and the KNP population. 
The average Ho ranged from 0.32+0.17 in BK and KNP to 
0.42+0.22 in LR. The He was the lowest in BK (0.31+0.15) 
and highest in LR (0.39+0.13). The average within-breed 
inbreeding was found to be -0.003 and was not 
significantly different from zero (p>0.05). Deficient 
heterozygosity was observed in both KNP and LR, resulting 
in inbreeding coefficients of 0.029 and 0.031, respectively. 
Significant deviation (p<0.05) from the Hardy-Weinberg 
equilibrium was detected in YK (3.6%) and KNP (2.37%) 
for the total number of SNPs (31,755). 

Genetic distances, differentiation, and admixture 

The levels of breed differentiation can be quantified 
using fixation indices. Population differentiation (Fst) 
estimates were significantly different (p<0. 00001) from 
zero, with an overall value of 0.27. Reynolds and Fst values 
among the five pig breeds are given in Table 2. The average 
Fst and Reynolds genetic distances between the Western pig 
breeds and KNP were 0.27 and 0.31, respectively. 
Divergence (Fst) was the lowest between LR and YK (0.20), 



while the largest divergence (0.30) was detected between 
KNP and DU. Similarly, the lowest (0.25) and highest 
(0.36) Reynolds genetic distances were observed between 
KNP and DU animals. To quantify the levels of within- and 
between-population variation, we conducted AMOVAs, and 
the results revealed that approximately 73% of the total 
variation consisted of within-breed variation and the 
remaining 27% of between-breed variation. 

The patterns of population structure were investigated 
using Structtire and PC A. As shown in Figure 1, principal 
components one (PCI) and principal components two (PC2) 
explained approximately 35.06% (Eigenvalue = 7.89) and 
25.20% (Eigenvalue = 5.67) of the total variation, 
respectively. The results indicate clear separations of DU 
and YK from the remaining breeds by PCI and PC2, 
respectively. Excepting 13% of the sampled individual 
animals, DU pigs were separately clustered from the 
remaining breeds. The Bayesian approach of Structure 
analysis for inferred populations with K values from 2 to 5 
is shown in Figure 2. When K=2, with the exception of the 
DU animals, all of the breeds shared a conmion cluster 
(91%). Only 13.20% of KNP and 18.6% of LR individuals 
shared common ancestry with DU. Surprisingly, when K = 
3, over 86% of KNP individuals were in a common cluster 
with YK and LR, while LR was the most admixed breed, 
with 60%, 24%, and 16% of its individuals in clusters with 
YK, BK, and DU, respectively. When four clusters were 
inferred, the majority (91%) of the breeds were assigned 
into separate groups, with the exception of LR, which 
showed consistent admixture with YK (43%) and BK (25%). 
When K = 4 or 5, approximately 91% of the KNP 
individuals were clustered into a separate group (Table 3). 

Linkage disequilibrium analysis 

The LD between adjacent SNP pairs was estimated 



Table 2. Reynolds (above diagonal) genetic distances and genetic differentiation (Fst) (below diagonal) among the pig breeds 



Breed Berkshire Duroc Korean native Landrace Yorkshire 



Berkshire 




0.33 


0.28 


0.25 


0.27 


Duroc 


0.28 




0.36 


0.32 


0.36 


Korean native pig 


0.24 


0.30 




0.30 


0.31 


Landrace 


0.22 


0.27 


0.26 




0.22 


Yorkshire 


0.24 


0.30 


0.26 


0.20 
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Figure 1. Breed clustering based on principal component analysis. PCI, principal components one; PC2, principal components two; PC3, 
principal components three. 



using both and r' for overall autosomes. The mean 

value of varied between 0.25 to 0.27, while the mean 
value for | D'| ranged from 0.48+0.49 in LR to 0.59+0.46 
in BK and YK (Table 4). An inverse relationship was 
observed between the LD estimates and distances between 
adjacent SNPs (Table 5). For example, the averge and 
I D'l values at 20 to 40 kb were 0.26 and 0.72, respectively, 
while at 500 to 1 Mb, they dropped to 0.10 and 0.53, 
respectively. Compare to that of \D'\, the decline of r with 
increasing distance between adjacent SNPs was much 
steeper The LD values at r' oj, the physical distance at 
which LD decays below a threshold of 0.3, ranged from 



72.40 kb in LR to 85.86 kb in YK. The average LD 
extended to moderate levels (r^ = 0.21) up to 60 to 100 kb. 

DISCUSSION 

In this study, we estimated the genetic variability, 
structure, and admixture in four Western pig breeds and one 
KNP population. Observed heterozygosity was highest in 
LR (0.42) and lowest in KNP and BK (0.32). In comparison, 
lower expected heterozygosity values of 0.26 and 0.27 have 
been reported for Chinese Jinhua and Ganxi pig breeds, 
respectively, from SNP data (Ai et al., 2013). However, 



K = 2 



K=3 



K=i 



K = 5 




Berkshire 



Duroc 



Korean native pig Landrace 



Yorkshire 



Figure 2. Summary of estimate plots of Q for K = 5 in the five pig breeds. Each individual is represented by a single column divided into 
K colored segments, where K is the number of clusters assumed, with lengths proportional to each of the K inferred clusters. Black lines 
separate the populations indicated below the figure. 
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Table 3. Proportions of membership of each predefined 
population at each of the K values, ranging from 2 to 5 clusters 



Breed 



Inferred clusters 



1 



Berkshire 
Duroc 

Korean native pig 

Landrace 

Yorkshire 

Berkshire 

Duroc 

Korean native pig 

Landrace 

Yorkshire 



0.009 
1 

0.13 
0.19 
0.03 



1st- 



0.961 
0.003 
0.088 
0.241 
0.025 



Berkshire 


0.019 


0.954 


Duroc 


0.002 


0.001 


Korean native pig 


0.913 


0.030 


Landrace 


0.143 


0.252 


Yorkshire 


0.006 


0.034 



0.991 
0.001 
0.868 
0.814 

0.972 

- 2nd 

0.027 0, 
0.000 
0.864 
0.600 
0.975 

3rd - 

0 
0 
0, 
0, 
0, 



012 

997 
,048 
159 
000 



Berkshire 
Duroc 

Korean native pig 

Landrace 

Yorkshire 



0.956 
0.003 
0.029 
0.036 
0.011 



0.015 0 

0.000 0, 

0.907 0 

0.002 0, 

0.001 0, 



.012 

.997 

.039 

.172 

.002 

4th- 

.011 

.997 

.037 

.019 

.000 



0.015 
0.000 
0.018 
0.433 
0.958 



0.012 
0.000 
0.009 
0.014 
0.876 



0.006 
0.000 
0.019 
0.929 
0.112 



higher Hg (0.40) was observed in the Chinese Kele pig in 
the same study. With exception of BK, KNP displayed a 
slightly lower level of heterozygosity than did the Western 
pig breeds, which was in agreement with the previous 
microsatellite- based studies (Kim and Choi, 2002; Kim et 
al., 2005), while F/^was negative in the Western breeds, 
implying that they are outbred. The observed heterozygosity 
deficiency in KNP may be attributed to inbreeding (Fjs = 
0.029). The within-breed genetic variability observed in this 
study was lower than that obtained in the microsatellite- 



Table 4. Average LD estimates in the five pig breeds between 
adjacent SNPs pairs 



Breed 



D 



r O.J (Kb) 



Berkshire 
Duroc 

Korean native pig 

Landrace 

Yorkshire 



0.25±0.35 
0.26±0.37 
0.25+0.35 
0.25±0.36 
0.27±0.36 



0.59+0.46 
0.53+0.48 
0.56±0.48 
0.48+0.49 
0.59+0.46 



73.93 
75.64 
76.24 
72.40 
85.86 



LD, linkage disquilbrium; SNPs, single nucleotide polymorphisms. 
The and \D'\ are measures of correlation coefficient of LD. 



based experiments (Kim et al., 2005; Nidup and Moran, 
2001). These authors found expected heterozygosity values 
of 0.494, 0.566, 0.599, 0.702, and 0.637 for KNP, BK, DU, 
LR, and YK, respectively, using microsatellite analysis. 
Yang et al. (2003) found expected heterozygosity values of 
0.70 and 0.87 in Mashen and Guanling Chinese pig breeds, 
respectively, by analyzing 26 microsatelUtes. The 
differences in the results obtained using SNP and 
microsatellite markers may reflect the multiallelic nature of 
microsatellite markers. 

Both FsT and Reynolds genetic distances revealed 
significant genetic differences among the pig breeds 
investigated. The lowest genetic differentiation (0.20) and 
Reynolds distances (0.22) were observed between two 
Western pig breeds (YK and LR). An average Fst value of 
0.26, which is in line with our results, has been reported 
between Asian and European pig breeds (Kim et al., 2005), 
but this is lower than the value (0.45) previously found 
between Chinese and Western pig breeds (Ai et al., 2013). A 
lower genetic differentiation (0.21) than that of the present 
results has previously been observed among Western pig 
breeds (Nidup and Moran, 2011). The inbreeding 
coefficient of 0.029 detected for KNP in this study was 
higher than that previously reported for the same population 
(-0.007) from an analysis of 16 microsatellite markers (Kim 
et al., 2005). A high level of inbreeding (Fjs) was also 
detected in this previous study (Kim et al., 2005). This 
result may be due to small population size and the mating of 



Table 5. Mean LD between adjacent SNPs over different map distances 



Distance (kb) 




SD 


Median 


D' 


SD 


Median 


0 


0.56 


0.39 


0.57 


0.88 


0.28 


0.99 


0-1 


0.55 


0.39 


0.56 


0.87 


0.27 


0.99 


1-10 


0.30 


0.32 


0.15 


0.77 


0.34 


0.99 


10-20 


0.29 


0.32 


0.16 


0.75 


0.36 


0.99 


20-40 


0.26 


0.30 


0.13 


0.72 0.36 


0.99 


40-60 


0.24 


0.28 


0.11 


0.70 0.37 


0.97 


60-100 


0.21 


0.27 


0.09 


0.67 0.38 


0.90 


100-200 


0.19 


0.25 


0.07 


0.64 0.39 


0.81 


200-500 


0.16 


0.23 


0.05 


0.58 


0.40 


0.69 


500-1 Mb 


0.10 


0.17 


0.03 


0.53 


0.39 


0.49 



LD, linkage disquilbrium; SNPs, single nucleotide polymorphisms; SD, standard deviation. 
The and Id'I are measures of correlation coefficient of LD. 
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related individuals. 

The separation of DU from the remaining breeds at 
assumed K values from 2 to 5 was supported by PCA. The 
close genetic relationship and admixture displayed between 
LR and YK was also captured by both PCA and Structure 
results. The estimation of LD can be used to map 
quantitative trait loci, and a decrease in LD as the distance 
between SNPs increased has been noted in previous studies 
of Western pig breeds (Nsengimana et al., 2004). The oj 
was lowest in LR and highest in YK. This value was lower 
than that previously reported for Western pig breeds (125 
kb) but higher than those reported for Chinese pig breeds 
(10.5 kb) and wild boars (38 kb) (Ai et al., 2013). In 
humans, LD extends up to 50 kb (Abecasis et al., 2001). 
The lower LD at the thereshold of = 0.3 observed in this 
study may be related to reduced effective population size, as 
most commericial breeds have undergone long periods of 
intesive selection. Similarly, the effective population size of 
KNP may be reduced because of the small number of 
animals used as foundation stock. The differences between 
the previous results and those of the present study may be 
justified by differences in sample size, marker density, or 
population demography. 

Based on our Structure anaysis, we conclude that there 
is a substantial introgression of Western pig breeds into the 
KNP genome. Considering its better meat quality and 
adaptabilty to low-management conditions (Park et al., 
2007; Kim et al., 2011), KNP should be conserved and 
improved for sustainable utilization. The higher inbreeding 
value observed in KNP could potentially reduce its genetic 
variabilty; therefore, a proper breeding strategy must be 
applied to minimize the mating of related individuals. The 
low LD detected in our analysis highhghts the necessity of 
high-density markers for quantitative trait loci (QTL) 
mapping and marker-assisted selection. In this study, a 
moderate LD (r^ = 0.2) was detected, extending only to 20 
to 40 kb. To confirm the results reported here, further 
investigation of the magnitude of LD in large samples using 
high-density SNP markers is required, which will bolster 
furture QTL mapping and marker-assisted selection. 
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